Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation
نویسندگان
چکیده
The task of indoor/ outdoor audio classification using foreground speech segmentation is attempted in this work. Foreground speech segmentation is the use of features to segment between foreground speech and background interfering sources like noise. Initially, the foreground and background segments are obtained from foreground speech segmentation by using the normalized autocorrelation peak strength (NAPS) of the zero frequency filtered signal (ZFFS) as a feature. The background segments are then considered for determining whether a particular segment is an indoor or outdoor audio sample. The mel frequency cepstral coefficients are obtained from the background segments of both the indoor and outdoor audio samples and are used to train the Support Vector Machine (SVM) classifier. The use of foreground speech segmentation gives a promising performance for the indoor/ outdoor audio classification task.
منابع مشابه
Moving object segmentation by background subtraction and temporal analysis
In this paper, we address the problem of moving object segmentation using background subtraction. Solving this problem is very important for many applications: visual surveillance of both in outdoor and indoor environments, traffic control, behavior detection during sport activities, and so on. All these applications require as a first step, the detection of moving objects in the observed scene...
متن کاملFusing Complementary Operators to Enhance Foreground/Background Segmentation
Foreground/background segmentation is an active research area for moving object analysis. We combine two probabilistic approaches one of which estimates foreground/background probabilistic density and the other uses prior knowledge to decompose the colour space. The observed performance advantages are associated with the fusion of operators with completely different basis. Tests on outdoor and ...
متن کاملOn the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks
A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techniques have recently emerged as a means for video concept detection complementary to visual analysis. Most state-of-the-art approaches rely on manual definition of predefined sound concepts such as “engine sounds”, “outdoor/indoor sounds”. These approaches come with three major drawbacks: manual definitions...
متن کاملEnhanced foreground segmentation and tracking combining Bayesian background, shadow and foreground modeling
In this paper we present a foreground segmentation and tracking system for monocular static camera sequences and indoor scenarios that achieves correct foreground detection also in those complicated scenes where similarity between foreground and background colours appears. The work flow of the system is based on three main steps: An initial foreground detection performs a simple segmentation vi...
متن کاملRobust Foreground Detection in Videos Using Adaptive Color Histogram Thresholding and Shadow Removal
Fundamental to advance video processing such as object tracking, gait recognition and video indexing is the issue of robust background and foreground segmentation. Several methods have been explored regarding this application, but they are either time or memory consuming or not so efficient in segmentation. This paper proposes an accurate and fast foreground detection technique for object track...
متن کامل